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Title of the Invention 



Model Predictive Control (MPC) System Using DOE Based Model 



Field of the Invention 



The present invention relates to a model predictive control system that 



10 uses a statistical model and more particularly but not exclusively to such a 
system using a DOE-based statistical model for example in a manufacturing 
control system that uses empirical data as part of an automatic process control 
(APC) model. 



Automatic process control as a means of controlling the conditions 
under which a process is c arried out is well known . For many years, simple 
and then steadily more complex closed loop control has been introduced. The 
control loop uses a formula or model to relate a process output to one or more 
20 inputs and, as the output varies, feedback is used to alter the inputs to ensure 
that the output stays on track. 

Certain processes, however, require large numbers of variables having 
complex relationships therebetween to be incorporated into a model for 



Background of the Invention 
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effective control. In particular some of the variables involved may be variables 
that are not changeable by a feedback signal, such as the quality of an input 
product. In such circumstances a fully comprehensive model is difficult to 
build. Such a fully comprehensive model may be particularly useful in the 
5 eveni that extremely high quality is required in the resulting product. An 
example is silicon wafer production. 

A model is essentially an educated guess as to the relationship between 
an output and one or more system inputs. The model is required to predict the 
behavior of the process under different input parameters. Accurate prediction 

10 is required if the process is to be controlled to produce desired results. Thus, 
methods of deriving a model may be referred to as prediction methods. 

A particularly useful group of prediction methods comprises what are 
known as empirical prediction methods. In empirical prediction methods, 
existing process data, that is to say actually measured inputs and outputs, are 

1 5 utilized to define the model. Different methods use different ways of analyzing 
the data and incorporating it into a model to arrive at a prediction of an output 
for any given set of input parameters. The term "data" is used herein to refer 
inte lJt lia t0 the QwitiGcttiori of any observable parameter regarding the 

process. 

20 Applicant's previously filed application no. USSN 09/689,884 concerns 

a manufacturing control system, known as a process output empirical modeler 
(POEM) that uses an empirical prediction method to provide a model as a basis 
for APC to operate a process, in particular a factory-based production process, 

2 
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The model divides both input and output parameters into discreet sections, 
builds vectors of all reasonable combinations of the different discrete sections 
of the input parameters and uses empirical data to associate each of the vectors 
with a statistical average of actual outputs corresponding to the given vector. 
5 The vectors, with their corresponding results are then placed in the form of a 
lookup table and used in APC as part of a control process that optimizes the 
inputs that can be varied, in the light of the inputs that cannot be varied, to 
arrive at a desired result. 

Another empirical method that may be used is the method of 
i=J 10 classification and regression trees, CART. The skilled person will be aware of 

Nl numerous other methods that make use of empirical information and to which 

CP the present considerations are applicable, such as CHAID, and Neural Nets. 

M. A disadvantage of the above system, and indeed of any system requiring 

s statistically significant empirical data, is that it requires relatively large 

fy 15 amounts of data before it can begin to rim effectively. Furthermore, it i$ not 

p sufficient to have a large quantity of data. It is additionally necessary to have a 

S good scatter of data across the input space. Certain parts of the input space 

may be utilized only rarely and it may require a very large number of 
experiments to effectively fill rarely used parts of the input space, In the case 
20 of POEM for example, each input vector should preferably have a statistically 
significant set of outputs that can be processed to provide a meaningful average 
output for the given vector. 

The data to be relied upon may often be user specific, as different 
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manufacturers, even if making the s&me product., may often insert their own 
proprietary variations to the process, or may use input materials from different 
sources, which input materials may behave slightly differently in the process. 
Manufacturers are not generally willing to provide data sets to their 
5 competitors, and system manufacturers generally do not carry out the process 
and thus do not have their own independent data sets to sell along with the 
system. 

For all of the above reasons, providers of the system are generally 
unable to provide meaningful datasets with the systems, 

10 Thus each new purchaser of a system is required to develop his own data 

set, and until he has done so the system cannot be used effectively. The 
number of process results required to provide statistically significant coverage 
of the entire input space is often very high, especially where there are large 
numbers of parameters involved, Depending on the process, individual 

1 5 experiments may be expensive or time-consuming or both. 



The present invention is concerned with the problem of providing a 
robust model that may allow operation of the system prior to the user having 
20 been able to obtain a statistically significant data set. 

According to a first aspect of the present invention there i s thus provided 
a system for automatic process control comprising an empirical prediction 
model of a process having an input space comprising input boundaries, the 



Summary of the Invention 
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model requiring empirical data, and wherein at least some data for the 
empirical prediction model is simulated data 

Preferably, the simulated data is data obtainable from a first formula 
describing the process. 
5 Preferably, the first formula is obtainable by regression from a data set 

of experimental results of the process run at least at its input boundaries. 

Preferably, the simulated data is obtainable from said first formula at 
desired points across said input space. 

Preferably, the data set comprises the results of experiments having 
10 input conditions and whose input conditions are definable by a geometric 
spacing of said experiments across said input space. 

Preferably, the geometric spacing is selectable to give an even spread of 
experiments across said input space. 

Preferably, the geometric spacing is selectable to cover at least the input 
15 boundaries and a center of the input space. 

Preferably, the geometric spacing is in accordance with DOE 
predetermined placing rules. 

Preferably, the first formula is any one of a group comprising a linear 
formula, a linear formula with interaction between inputs, a quadratic formula 
20 and a quadratic formula with interaction between inputs. 

Preferably, the input space is divisible into discrete regions, and wherein 
said empirical model comprises predicted process outputs associated with each 
discrete region. 
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In an embodiment, the outputs are process outputs, 
Preferably, results for said predicted outputs are producible by said first 
formula and obtainable from running said process, said results being 
interchangeable within said empirical model 
5 The system preferably further comprises an empirical results quantity 

assessor for interchanging results produced by said first formula with results 
obtained from running said process when said results obtained from running 
said process are assessed to be statistically significant according to at least one 
predetermined criterion of significance. 

10 Preferably, the system has a prediction quality assessor for 

interchanging results obtained by running said process with results obtained 
from said First formula when a prediction of said empirical model is assessed to 
diverge significandy from an outcome of said process according to at least one 
predetermined criterion of significance. 

15 The system preferably comprises a process model using data and a data 

model for generating data for said process model and an empirical data 
extractor for extracting data from said process for said model. Preferably, said 
data used by said process model is interchangeable between data obtained by 
said data model and data obtained by said extractor. 

20 Preferably a prediction quality assessor interchanges results obtained by 

said extractor with results obtained from said data model when a prediction of 
said process model is assessed to diverge significantly from an outcome of said 
process according to at least one predetermined criterion of significance. 
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A preferred embodiment comprises an empirical results quantity 
assessor for interchanging results produced by said data model with results 
obtained by said extractor when results obtained from running said process are 
assessed to be statistically significant according to at least one predetermined 
5 criterion of significance. 

Preferably, said process model is a lookup table. 

The lookup table preferably comprises output values for discrete regions 
of an input space within which said process is operable. 

The data model is preferably a formula obtainable from outputs of said 
10 process associated with geometrically spaced points of an input space within 
which said process is operable. 

Preferably, said formula is any one of a group comprising a linear 
formula, a linear formula with interactions, a quadratic formula and a quadratic 
formula with interactions. 
15 Preferably, geometrically spaced points are evenly distributable about 

said input space. 

Preferably, said geometrically spaced points comprise points placed on 
the boundaries of said input space and a point placed at a center of said input 
space. 

20 According to a second aspect of the invention there is provided a 

method of automatically controlling a process, using a data-based process 
model comprising the steps of 

generating data for said process model using a data generation formula. 
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and 

controlling said process using said generated data in said process model. 
Preferably, the process has an input space and said data generation 
formula is obtained by running said process at preselected points in said input 
5 space. 

Preferably, the preselected points are orthogonally placed in said input 

space. 

Preferably, the preselected points axe evenly spaced in said input space. 

Preferably, at least some of said preselected points are placed at 
1 0 boundaries and a center of said input space. 

A preferred embodiment further comprises a step of replacing said 
generated data with data empirically obtained during the running of the 
process, 

Preferably, the step of replacing said generated data is carried out when 
15 said data obtained empirically has reached a threshold of significance 
according to at least one predetermined significance criterion. 

A preferred embodiment comprises a further step of reverting to data 
generated using a data generation formula. 

Preferably, said step of reverting is carried out when results predicted by 
20 said data-based process model are detected to diverge from empirically 
measured process results by an amount exceeding a threshold of significance 
according to at least one predetermined significance criterion. 

A further preferred embodiment uses an evolutionary approach, to find 
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an optimal operating region. The embodiment begins operating at a first input 
space, and if unsatisfied with the results achieved finds a further input space. 
Output results obtained in the second input space are compared with the first 
and used to determine whether or not to move to a new operating area or input 
5 space, Such an embodiment comprises the steps of 
building a formula for a first input space, 
obtaining process output data for said first input space, 
building a formula for a second, input space, 
obtaining process output data for said second input space, 
10 comparing said process output data for said second input space with 

process output data for said first input space, 

on the basis of said comparison selecting a third input space for 
obtaining process output data> 

and operating said process in an optimal one of said input spaces. 

15 

Brief Description of the Drawings 
For a better understanding of the invention and to show how the same 
may be canied into effect, reference will now be made, purely by way of 
example, to the accompanying drawings. 
20 With specific reference now to the drawings in detail, it is stressed that 

the particulars shown are by way of example and for putposes of illustrative 
discussion of the preferred embodiments of the present invention only, and are 
presented in the cause of providing what is believed to be the most useful and 
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readily understood description of the principles and conceptual aspects of the 
invention, In this regard, no attempt is made to show structural details of the 
invention in more detail than is necessary for a fundamental understanding of 
tne invention the description taken with the drawings making apparent to those 
5 skilled in the art how the several forms of the invention may be embodied in 
practice. In the accompanying drawings: 

Fig. 1 is a simplified flow diagram of a process for building a useful 
model to simulate empirical data, 

Fig. 2 is a simplified screen display for use in defining inputs and 
1 0 outputs for preparation of a model in the process of Fig. 1 , 

Fig. 3 is a simplified screen display for use in defining a;type of model 
in the process of Fig. 1 , 

Fig, 4 is a simplified screen display showing a table of experiments that 
may typically be recommended for building a model in the process of Fig. 1, 
15 Fifr 5 is a simplified flow diagram showing the continuation of the 

process of Fig. 1, and detailing the bui lding of a model for simulating empirical 
data followed by use of the data for building a model for automatic process 
control. 

Fig. 6 h a screen display showing a typical model for simulating 
20 empirical data that may be produced by the process of Fig. 5. 

Fig. 7 is a typical screen display showing various statistical tools and 
data that may be used to study a formula such as that of Fig. 5, 

Fig. 8 is a simplified graph showing evolutionary operation of DOE over 
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an input space, 

Figs 9 - 13 are simplified graphs illustrating DOE rules for defining the 
numbers and contents of experiments, and showing respectively two level two 
factor, three level two factor, three level three factor, two level, two factor 
5 central composite design and three level two factor central composite design, 



Before explaining at least one embodiment of the invention in detail, it 
is to be understood that the invention is not limited in its application to the 
10 details of construction and the arrangement of the components set forth in the 
following description or illustrated in the drawings. The invention is applicable 
to other embodiments or of being practiced or carried out in various ways. 
Also, it is to be understood that the phraseology and terminology employed 
herein is for the purpose of description and should not be regarded as limiting. 

15 In the preferred embodiment of the present invention, a process or 

process step is taken and defined in terms of significant input and output 
parameters. The parameters are then related using a model type, for example 
linear, linear with interactions, quadratic, and quadratic with interactions . The 
number of parameters plus the model type preferably allows the calculation of 

20 how many experiments should be performed. Subsequently the system 
suggests input parameters for each of the experiments in such a way that result 
data representing the entire spread of the input space may be efficiently 
obtained. Essentially the process is operated in different combinations of its 



Description of the Preferred Embodiments 
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extreme operating points for each of the input parameters in such a way as to 
permit extrapolation across the entire input space. 

The user is then encouraged to carry out the recommended experiments 
and record the results. The results are processed using a mathematical method 
5 called regression to expand the .results from the extremes to cover the entire 
input space in a meaningful manner. The expanded results are then used as the 
raw data in an empirical method such as POEM. 

In this way the empirical method may be used after only a small number 
of results have been obtained, depending on the number of inputs and the way 
10 in which they are related. The generation of a full set of empirical data results 
with statistically significant data for all sectors of the input space, by contrast, 
may take a significant length of time. 

Reference is now made to Fig, 1, which is a simplified flow diagram 
.showing a procedure for obtaining data suitable for empirical prediction 
15 methods, operable in accordance with a first embodiment of the present 
invention. 

In Fig. 1, a process step is first defined in terms of a series of inputs and 
outputs. An input is given a name, and is defined to be of any one of a number 
of types. The types may be "measurable", meaning that the input can be 
20 measured by the automatic process control (APC) but cannot be altered by it. 
Alternatively, the input may be of type "controllable", meaning that not only 
can it be measured but that it can be controlled by the APC. The input is then 
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associated with a unit type. For example, fine length measurements may be 

defined in terms of angstroms or microns. 

Each of the inputs may then be associated with a measurement range. 

The range is preferably defined in terns of a maximum expected or permitted 
5 level and a minimum expected or permitted level. 

If there is more than one output associated with any given input, then the 

input may be associated independently with each of two different output levels. 

This is for example achievable by associating the input with different 

weightings for each of the associated outputs. 
;5J 10 Likewise, one or more outputs are defined, again in terms of a 

I? measurement unit and a range. The range may be inserted with upper and 

lower limits as was done with the inputs or alternatively a simple range having 
^ a numerical value but not having upper or lower limits may be entered. 

- Reference is now made to Fig. % which is a simplified diagram showing 

Ft! J. 5 the layout of a computer screen for defining a process step as described above. 

01 the screen shows definitions being entered for a step, step 1, of a process, 

□ 

Q process L The step comprises a first input, measl, of type "measurable" 

having units of microns, and a second input, contrl. of type "controllable", 
having units of angstroms. Lower and upper range limits are not shown . 
20 Likewise, the step comprises two outputs, outl and out2, having units 

respectively of microns and angstroms, and ranges respectively of 300 microns 
and 400 angstroms. 

Returning now to f ig. 1, and the step of defining a range for each input 
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and for each output is followed by a step of defining a relationship between the 
inputs and the outputs. The types of relationships that may typically be 
available are summarized below in table 1. Other types of relationships are 



possible, as the skilled person will be aware. 





Model 


Description 




Linear without 


Shown below is a simplified example of a linear model without interactions. 




i nteraeuons 








Y = ba + UXi + b 2 Xz + W(s 






1 _^ 1 






This type of model entails the smallest number of experiments. The fewer 






the experiments, the less accurate the model will be. To be used only when 






neither quadratic behavior nor i me nations are expected. 




Linear with 


Shown below is a simplified example of a linear model with interactions. 




interucvions 








Y = bofbiX» +b3Xi-rbiX 3 -r-buXi X2 + bi 3X1X3+1^X2X3 
1 1 ! 1 


Nl 




SJ 1 *— " V ' 

Linear interactions 


JE 




Use only when quadratic behavior U not expected. 


fn 


Quadratic without 


Shown below is a simplified example of a quadratic model without 




inlei actions 


interactions. 






Y = bo + b| X; + b 2 X : +1*3X3 + br, X, 2 + b 22 Xl+ b»xS 






1 — ^ 1 L— , i 

Linear Quadradic 






For use Wily When quadratic behavior is expected, but without interactions. 




Quadrutis with 


Shown below is a simplified example of a quadratic model with interactions. 




interactions 


Y-bto + bX, +b 2 X 2 +b3X 3 +b l2 X, X 2 + bi 3 X,X 3 + foXaXa 










1 y 1 1 v 1 

Linear Interactions 






, + bi 1 X\ + bj2X2+ ^3X3 






1 V ' 

Quadratic 






Fcr use when expecting quadratic behavior with interactions, or when unsure 






of the model behavior. This model requires the moid experimentation, and 






provides the most accurate model. 



5 Table 1 - Typically available model types 



As shown in table 1, a linear model without interactions is the simplest 
type of relationship between the inputs and the outputs. The model requires the 
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least amount of experimentation to provide statistically significant data. The 
most complicated model type shown is the quadratic relationship with 
interactions. It requires the largest number of experiments in order to obtain 



sufficient data to determine the model coefficients. The quadratic relationship 
5 with interactions is a particularly preferred mathematical approximation for a 
typical process and is the model type that is ideally selected when nothing is 
known about the relationship to be modeled. 

In table 1, linear and quadratic models only are shown. It is also 
possible to use cubic and higher order models should the process to be 
10 controlled require it. However, cubic and higher order models are used only 
rarely in process control. 



Following the selection of the most appropriate model type* it is possible 
to suggest a recommended number of experiments. The recommended number 
is calculated using predetermined rules based on the selected model type and 

15 the number of inputs, line way in which the number of experiments is 
calculated is part of the DOE procedure and will be described in greater detail 
at the end of this description with reference to Figures 9 to 1 3 , 

Reference is now made to Fig. 3 S which shows a generalized screen 
display for selecting a model. In the display, the four available model types are 

20 shown, and the desired selection is highlighted. Under a heading 
"Experiments" is shown a recommended number of experiments that has been 
calculated based on the selected model type and the number of inputs defined 
earlier. In this case, the recommended number shown is 1 8. 



15 
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Returning again to Fig. 1, and the system, now recommends input values 
to be used for each of the recommended experiments. The input levels are 
selected so as to he scattered in an effective manner around the whole space. 
This includes running the process at its extremes for each of the input variables. 
5 The results ars preferably provided as a table, each row representing a different 
experiment. Such a table is shown in Fig. 4 P with spaces for entering output 
values. 

The process is preferably carried out for each one of the recommended 
experiments and the results obtained are entered into the table. 
10 Reference is now made to Fig, 5, which is a simplified flow diagram 



showing how the data obtained from the experiments described with reference^ 
to Fig. 1 may be used to form a lockup table for the POEM empirical method. 

In Fig. 5, a mathematical stage of regression is carried out to calculate a 
formula that describes the process output in terms of the inputs. More 

15 precisely, regression involves the taking of the formula of the model type 
selected previously and. deriving coefficients for each of the terms so as best to 
fir the input data with the observed outputs. Fig. 6.shows a typical series of 
coefficients that may be generated in this way. 

Once the formula is available, it may be used to build a full statistical 

20 results database by calculating outputs across the whole of the input space. As 
mentioned above, DOE experiments are preferably carried out at least at all of 
the extreme points of operation of the process. Thus preferably none of the 
output calculations involves extrapolation of data outside the area in which 

16 
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experimentation has been carried out, giving additional validity to the results. 

Reference is now briefly made to Fig. 7 3 which is a simplified diagram 
showing statistical data which may be used to verify the quality of the formula 
obtained, The use of the data shown is apparent to the skilled person and is not 
5 described in further detail. 

In the following, the use of the results database for POEM is explained, 
although it will be appreciated by the reader that the use of the statistical results 
database is not restricted thereto, and is particularly applicable to other 
empirically-based methods such as CART referred to above. 
~ 10 Each of the Input ranges is divided into a plurality of discrete regions 

~j[ and the input variable is henceforth described in terms of the discrete regions 

4* 

ffj into which it has been divided. For n input variables, the input space is now 

M= describabie as a series of n-dimensional vectors which as a whole cover all 

s input combinations. Each vector entry preferably lias a corresponding series of 

f=5S 

fy 15 results, so that an average can be taken of the corresponding results to produce 

fl \ 

m an average for the vector. Annealing is preferably carried out to improve the 

precision of the process. 

The vectors thus produced are incorporated into a lookup table for 
control of the process and the process is controlled by reference thereto for 
20 selection of the best recipe to control the process. Such a best recipe i$ 
searched for which differs from a current recipe in that the output is more 
favorable (meaning closer to the target value), the measurable variables are the 
same, and only the controllable variables differ. 
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Iii a preferred embodiment, once the system begins to operate using data 
obtained according to the above procedure, actual process data is obtained. 
The actual data may be steadily accumulated into the database or the user may 
prefer to wait for a period of time until he has a statistically significant quantity 
5 of data 

In trie following example, use of the invention for improving the yield in 
a semiconductor manufacture process, is described. 

In the process, a silicon wafer is sharpened, that is to say an outer layer, 
perhaps an oxide layer, is removed. 

J 0 The process involves the following input parameters; 

• The thickness of the wafer before the process starts 

• The temperature of the machine while sharpening. 

• The pressure within the machine whilst sharpening. 

The process has a single output parameter, namely the thickness of the 
1 5 wafer foil owing processing. 

The aim of applying th e above -described embodiment to the process is 
to achieve a precise thickness at the end of the process although the relationship 
between the process's parameters result are initially unknown. 

According to the DOE methodology a model type is selected, A number 
20 of experiments is set and input values for each of the experiments are 
calculated. Then mathematical regression is used to calculate a model, such as 
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the following: 

ThicknessAfter = 0.2*Thickn8SsBefore - 0>09*Temperature + 
USVressure - 38 

Using this model we now build a lookup table according to the POEM 
5 algorithm. 

In the POEM algorithm each input variable is firstly divided into sub- 
intervals, for example: 



Variable Name 


Interval 


Number 
of Sub- 
Intervals 


Sub-Intervals 


Thickness 


300-650 


4 


A: 300 - 387.5 
B: 387.5-475 
C: 475 -562.5 
D: 562.5 -650 


Temp 


200-400 


2 


A: 200-300 
B: 300-400 


Pressure 


68-97 


3 


A; 68-77.6 
B: 77.6-87.3 
C: 87.3-97 



Table 2 Division of Input Ranges Into POEM Discrete Units 



The input space now comprises a series of the possible combinations of 
10 the input sub-intervals. Each combination is written as a vector and each of 
th ese input vectors defines a finite region of the input space. 

For each vector, the model (the formula) is now used to calculate a 
corresponding predicted output value of the process. 

In applying the model to each vector, preferably a midpoint in the vector 
15 is taken to apply values to the formula. A results set as follows may be 
produced. 
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Input Vector 

{Thickness, 
Tenip, Press} 


Center of the Vector 
Spaed, used as input to 
tbe DOE model 


Predicted output 
according to the DOE 
model 


{A, A, A} 


{343.75, 250. 72.8} 


113.8 


{A.B.C} 


{343.75. 350.92.15} 


132.9 


{D.A.C) 


{606.25.250.92.15} 


194.4 



Table 3. POEM Results Set Using DOE Data 



In the wafer example, for the inputs given, the table may contain 4*2*3 
= 24 cells in the lookup table. 
! 5 From now on, APC may proceed in the normal way using the POEM 

: methodology as if the vector set represents genuine empirical information. 

While using the synthesi2ed data for APC, every run of the process 
provides experimental data for collection which is genuinely empirical, The 
data being collected can be used for improving the prediction ability of the 
10 empirical method that is being used. 

As a rule, the prediction ability of an empirical model improves as the 

i 

amount of observed data increases. 

Updating of the data set from DOE data to genuine empirical data is 
shown for purposes of example in conjunction with POEM methodology, but 
15 the skilled person will be aware that it may be implemented for any other 
empirical method. 

Returning to the above example of wafer sharpening and as previously 
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mentioned with reference to table 3, a POEM lookup table was created using 
data from the DOE formula. 



Table 3 is here partly repeated for convenient reference: 



Vector 

{Thickness, Temp, 
Pressure} 


.Predicted output according to the 
calculated (DOE) model 


{A, A, A} 


113.8 


{A,B,C} 


132.9 


{D,A,C> 


194.4 



Table 3 Partially Repeated for Convenient Reference 
Acaial results obtained during running of the process are preferably 
stored in a separate but corresponding table, exemplified by Table 4 below. 
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Values: {Thickness, Temp, 
Pressure) 


Observed output result 


{604,260,93} 


193.8 


{580255,89} 


192.1 


{634,290,93} 


195.0 



Table 4 Empirical Results 



The above three sets of values are values which belong to the {D, A ? C} 
vector group of table 3 above. In each case> all of the results corresponding to 
y 5 each respective input vector are taken and an average or equivalent statistic 

Nj calculated. As there are now a number of observed results that belong to the 

01 same input vector in the lookup table, it is possible to update the 'output' 

M= column belong to that vector to be the average of the respective observed 

s results. In this case the average may be calculated as follows: 

g J93.8 + 192.1 + I.9S _ m6 

ry 1° 3 which differs slightly from the previous 

On 

Pj expected output (194.4) obtained using DOE. 

hi addition to POEM there are many other empirically based 
mathematical systems that may be used. Furthermore there are numerous 
methods, in addition to those already described, for incorporating newly 
15 obtained empirical data into the model. For example, when obtaining 
information, it is possible to enter the data directly into the current lookup 
lable. Alternatively it is possible to wait until a statistically significant sample 
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ha$ been obtained before altering the look up table. In deciding when to 
incorporate newly obtained empirical data it should be borne in mind that the 
result of a single process may be unrepresentative, and it is preferable to avoid 
using questionable, that is not statistically meaningful data in the look up table. 

5 The look up table is being used directly to control the process and thus use of 
statistically questionable data could, as a worst case, lead to instability in the 
process being controlled. 

In a preferred embodiment of the present invention an APC is able to 
swiich automatically between DOE data and empirical data. Not only is it able, 

10 as described above, to replace DOE data with empirical data when there is 
statistically significant empirical data to be used, but additionally or 
alternatively, it is able to detect wandering of the actual process outputs from 
the predictions of the lookup table. Preferably the amount of wandering is 
tested for statistical significance. Additionally or alternatively it is tested 

15 against a threshold. The threshold may be a statistical threshold. If the process 
is found to have wandered beyond acceptable limits then the current empirical 
data may be discarded and the DOE process is begun again. Following the 
building of a DOE based look up table a new set of empirical data is obtained. 



20 previously obtained DOE formula is reverted to. 

In a further embodiment of the present invention, alternative DOE and 
empirical processes are built into an APC device. The device is installed and 
automatically sets itself to carry out a DOE set of predetermined experiments 



In a further embodiment, instead of beginning the DOE process again, a 
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as described above in. an initial learning process. Daia obtained from the 
formula is then automatically made into a lookup table as described above, 
without user intervention and used for APC during a secondary learning 
process. During the secondary learning process empirical data is obtained, 
5 again automatically, and incorporated into the lookup table in one of the ways 
described above to replace the DOE data in a filial phase in which, preferably, 
learning continues to occur as the empirically obtained data is continuously 
updated in the light of more recent results. Additionally, the embodiment may 
incorporate automatic return to DOE following significant wandering of actual 

O 

."S 1 0 results from predicted results, as described above, 

J In the above pre4nstalled embodiment, preferably the process type is 

known and thus the model type, linear, quadratic etc may be preset. Thus the 
maximum user intervention that is necessary is preferably arranging the 

H settings for the DOE defined experiments. 

RJ 15 Reference is now made to Fig, 8, which is a simplified graph of a two- 

fjl dimensional input region 80, illustrating a mode of using DOE known as 

□ 

rj evolutionary operation. Two inputs I] and I 2 define between them a total input 

space 80. Within the total input space 80, a smaller input region 82 is used in a 
first set of DOE experiments. The smaller region 82 may have been chosen 
20 because it is believed to produce the best results or because it is easiest to .run 
experiments in this region or because resources for carrying out the 

i 

experiments are limited, or for any other reason. In particular, a reason may be 
a decision on methodology of searching the input area since carrying out DOE 
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on the total area 80 may not detect, or may not be good at detecting, subtle 
variations in output in localized parts of the region SO* 

DOE results for input region 82 are thus obtained and used in the normal 
way. Later on, perhaps because unsatisfactory outputs are achieved, it is then 
5 decided to investigate a second, preferably contiguous region 84. The second 
region 84 may yield results which are better, worse or indifferent. If the results 
are better then it will bt clear that region 84 is a better region within which to 
run the process than region 82. Furthermore, if there is an improvement in 
yield when moving in the direction of region 84, then it makes sense to 
4j 10 investigate further regions contiguous to region 84. For example, region 86 

^ may be searched. Again, if region 86 produces an improved yield then it 

9 1 makes sense to investigate farther regions contiguous to region 86, such as 

M= region 88, until no further improvement in the yield i s found. 

-JG 

h It wilJ be appreciated that the .investigation of contiguous regions is 

fy 1 5 susceptible to ending at a local maximum, and thus in a preferred embodiment 

oR investigation is not limited to contiguous regions. A disadvantage of this 

approach however i s in the cost of an increased number of experiments, 

Reference is now made to Figs. 9-13 which are graphs of typical input 
regions for DOE analysis. The graphs illustrate the experiment placing 
20 strategies that DOE uses in order to obtain meaningful information about the 
input space. 

Fig, 9 shows a simple two-input system. The A and B axes represent the 
two inputs arid a square 90 represents the input space that is of practical interest 
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to the experimenters. For each input, experiments, indicated by circles 92, are 
carried oat for minimum and maximum levels of that input. A total of four 
experiments is carried out to obtain what is known as a 2-level full factorial 
result, that is to say wo levels for each input. 
5 Reference is now made to Fig. 10, which is the same as Fig. 9 except 

that three levels are studied for each one of two inputs to give a three level fall 
factorial result In general, for k level full factorial the number of experiments 
needed is 
k fl , 

10 where n is the number of inputs or factors. As will be appreciated, the input 
~2J conditions for each experiment are as defined by its position in the graph, 

y As well as full factorial combinations, it is also possible to use fractional 

M 1 factorial combinations. A fractional value is selected such as V4, and this 

s enables a significant decrease in the number of experiments at the cost of losing 

fU ] 5 information regarding interactions between the different inputs or factors, 

rp Reference is now made to Fig. 11, which is a graph showing a three 

Fj teve.1 flill factorial design for three inputs. A total of 3 3 » 27 experiments are 

recommended under evenly distributed input conditions. 

Reference is now made to Fig. 12, which is a simplified graph showing 
20 what is known as central composite design (CCD). CCD is efficient for 
calculating quadratic models, and comprises taking experimental points 
calculated according to the full or partial factorial model as required, and 
adding to this a center point and two radial or star points foT each factor. 
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In a preferred embodiment, instead of taking a single center point, two 
centrally located points are selected. 

In Fig. 12 which has two factors, four radial points are taken, and in Fig. 
13, which shows the equivalent case for three factors, six radial points are 



DOE thus preferably provides an orthogonal array of points 
geometrically arranged over the input space, The arrangement may be such as 
to cover the input space evenly or may use a CCD arrangement. If a 
boundaries only version such as that of Fig. 9 is used for a linear model, it is 
10 recommended to take two central points as well to confirm that the process is in 
fact linear. 

In carrying out DOE, it may be borne in mind that the more experiments 
that are carried out the more accurate the resulting model may be expected to 
be s on the other hand the longer and more expensive the experimental setup 
15 stage becomes. Thus a DOE routine attempts to strike a balance between 
accuracy and efficiency in selecting the number of experiments. Fitrthennore 
the user may choose not to carry out some of the recommended experiments. 
Generally, it will still be possible in such a case to generate a model, although 
accuracy will be lost. 

20 According to the above-described embodiments, APC methods using a 

process model for control, and requiring empirical data to build toe model, 
which data is difficult, expensive or time consuming to obtain, may be fed with 
useful data after a minimal number of experiments in a reduced learning period, 



5 



taken. 
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using rigid experimental design techniques such as DOE. Embodiments may 
gradually or otherwise replace the calculated data with actual empirical data as 
it becomes available and certain preferred embodiments may revert to existing 
or newly obtained DOE data when it becomes apparent that a process being 

5 controlled has wandeTed from an existing process control model. 

It is appreciated that certain features of the invention, which are, for 
clarity, described in the context of separate embodiments, may also be provided 
in combination in a single embodiment. Conversely, various features of the 
invention which are, for brevity, described in the context of a single 

10 embodiment, may also be provided separately or in any suitable 
subcombination. 

Ir will be appreciated by persons skilled in the art that the present 
invention is not limited to what has been particularly shown and described 
hereinabove. Rather the scope of the present invention is defined by the 
.15 appended claims and includes both combinations and subcombinations of the 
various features described hereinabove as well as variations and modifications 
thereof which would occur to persons skilled in the art upon reading the 
foregoing description. 
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